Search CORE

Edinburgh Research Explorer

A method for studying protistan diversity using massively parallel sequencing of V9 hypervariable regions of small-subunit ribosomal RNA genes

Author: A Chao
A Chao
C Quince
CD Sinigalliano
DT Kysela
E Pruesse
Elizabeth A. McCliment
F Zhu
GJ Olsen
Gordon Langsley
Hugh W. Ducklow
HW Ducklow
JA Huber
JD Neufeld
L Amaral-Zettler
L Medlin
Linda A. Amaral-Zettler
ML Sogin
NR Pace
PD Schloss
S Huse
S-M Lee
SM Huse
Susan M. Huse
W Ludwig
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 01/01/2009
Field of study

© 2009 The Authors. This article is distributed under the terms of the Creative Commons Attribution License. The definitive version was published in PLoS ONE 4 (2009): e6372, doi:10.1371/journal.pone.0006372.Massively parallel pyrosequencing of amplicons from the V6 hypervariable regions of small-subunit (SSU) ribosomal RNA (rRNA) genes is commonly used to assess diversity and richness in bacterial and archaeal populations. Recent advances in pyrosequencing technology provide read lengths of up to 240 nucleotides. Amplicon pyrosequencing can now be applied to longer variable regions of the SSU rRNA gene including the V9 region in eukaryotes. We present a protocol for the amplicon pyrosequencing of V9 regions for eukaryotic environmental samples for biodiversity inventories and species richness estimation. The International Census of Marine Microbes (ICoMM) and the Microbial Inventory Research Across Diverse Aquatic Long Term Ecological Research Sites (MIRADA-LTERs) projects are already employing this protocol for tag sequencing of eukaryotic samples in a wide diversity of both marine and freshwater environments. Massively parallel pyrosequencing of eukaryotic V9 hypervariable regions of SSU rRNA genes provides a means of estimating species richness from deeply-sampled populations and for discovering novel species from the environment.This work was supported by grants from the W.M. Keck Foundation and the Woods Hole Center for Oceans and Human Health from the National Institutes of Health and National Science Foundation (NIH/NIEHS 1 P50 ES012742-01 and NSF/OCE 0430724-J) (LAZ and SH)

Public Library of Science (PLOS)

CiteSeerX

Woods Hole Open Access Server

Public Library of Science (PLOS)

Prospecting environmental mycobacteria: combined molecular approaches reveal unprecedented diversity

Background: Environmental mycobacteria (EM) include species commonly found in various terrestrial and aquatic environments, encompassing animal and human pathogens in addition to saprophytes. Approximately 150 EM species can be separated into fast and slow growers based on sequence and copy number differences of their 16S rRNA genes. Cultivation methods are not appropriate for diversity studies; few studies have investigated EM diversity in soil despite their importance as potential reservoirs of pathogens and their hypothesized role in masking or blocking M. bovis BCG vaccine. Methods: We report here the development, optimization and validation of molecular assays targeting the 16S rRNA gene to assess diversity and prevalence of fast and slow growing EM in representative soils from semi tropical and temperate areas. New primer sets were designed also to target uniquely slow growing mycobacteria and used with PCR-DGGE, tag-encoded Titanium amplicon pyrosequencing and quantitative PCR. Results: PCR-DGGE and pyrosequencing provided a consensus of EM diversity; for example, a high abundance of pyrosequencing reads and DGGE bands corresponded to M. moriokaense, M. colombiense and M. riyadhense. As expected pyrosequencing provided more comprehensive information; additional prevalent species included M. chlorophenolicum, M. neglectum, M. gordonae, M. aemonae. Prevalence of the total Mycobacterium genus in the soil samples ranged from 2.3×107 to 2.7×108 gene targets g−1; slow growers prevalence from 2.9×105 to 1.2×107 cells g−1. Conclusions: This combined molecular approach enabled an unprecedented qualitative and quantitative assessment of EM across soil samples. Good concordance was found between methods and the bioinformatics analysis was validated by random resampling. Sequences from most pathogenic groups associated with slow growth were identified in extenso in all soils tested with a specific assay, allowing to unmask them from the Mycobacterium whole genus, in which, as minority members, they would have remained undetected

Warwick Research Archives Portal Repository

FigShare

Microbial community composition in sediments resists perturbation by nutrient enrichment

Author: A Jayakumar
AC Tyler
BB Ward
BC Crump
Bess B Ward
BL Howes
CE Bagwell
CR Lovell
CS Hopkinson
G Braker
Hilary G Morrison
HW Paerl
I Valiela
I Valiela
I Valiela
I Valiela
Ivan Valiela
JA Fuhrman
JA Fuhrman
JA Huber
Jennifer L Bowen
JF Biddle
JL Bowen
JN Galloway
John E Hobbie
K Koop-Jakobsen
K Koop-Jakobsen
LA Deegan
LA Deegan
LD Brin
LE Osterman
Linda A Deegan
Mitchell L Sogin
ML Sogin
MR Hamersley
MS Rappé
NN Rabalais
PE Galand
SD Allison
SE Bulow
SM Huse
SM Huse
SM Huse
V Kunin
WG Zumft
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/11/2010
Field of study

Author Posting. © The Author(s), 2010. This is the author's version of the work. It is posted here by permission of Nature Publishing Group for personal use, not for redistribution. The definitive version was published in The ISME Journal 5 (2011): 1540–1548, doi:10.1038/ismej.2011.22.Functional redundancy in bacterial communities is expected to allow microbial assemblages to survive perturbation by allowing continuity in function despite compositional changes in communities. Recent evidence suggests, however, that microbial communities change both composition and function as a result of disturbance. We present evidence for a third response: resistance. We examined microbial community response to perturbation caused by nutrient enrichment in salt marsh sediments using deep pyrosequencing of 16S rRNA and functional gene microarrays targeting the nirS gene. Composition of the microbial community, as demonstrated by both genes, was unaffected by significant variations in external nutrient supply, despite demonstrable and diverse nutrient–induced changes in many aspects of marsh ecology. The lack of response to external forcing demonstrates a remarkable uncoupling between microbial composition and ecosystem-level biogeochemical processes and suggests that sediment microbial communities are able to resist some forms of perturbation.Funding for this research came from NSF(DEB-0717155 to JEH, DBI-0400819 to JLB). Support for the sequencing facility came from NIH and NSF (NIH/NIEHS-P50-ES012742-01 and NSF/OCE 0430724-J Stegeman PI to HGM and MLS, and WM Keck Foundation to MLS). Salary support provided from Princeton University Council on Science and Technology to JLB. Support for development of the functional gene microarray provided by NSF/OCE99-081482 to BBW. The Plum Island fertilization experiment was funded by NSF (DEB 0213767 and DEB 0816963)

Woods Hole Open Access Server

Analysis of 16S rRNA Amplicon Sequencing Options on the Roche/454 Next-Generation Titanium Sequencing Platform

Author: A Engelbrektson
A Lykidis
Ahmed Moustafa
Chiachi Hwang
Chris L. Wright
GD Wu
Hideyuki Tamaki
JR Cole
Jyothi Thimmapuram
M Palatinszky
MF Polz
ML Sogin
N Youssef
Q Wang
Qiaoyan Lin
SG Tringe
Shiping Wang
SM Huse
SM Huse
TJ Hamp
TM Schmidt
Wen-Tso Liu
Xiangzhen Li
Y Wang
Yoichi Kamagata
Z Liu
Z Liu
Publication venue: Public Library of Science
Publication date
Field of study

BACKGROUND: 16S rRNA gene pyrosequencing approach has revolutionized studies in microbial ecology. While primer selection and short read length can affect the resulting microbial community profile, little is known about the influence of pyrosequencing methods on the sequencing throughput and the outcome of microbial community analyses. The aim of this study is to compare differences in output, ease, and cost among three different amplicon pyrosequencing methods for the Roche/454 Titanium platform METHODOLOGY/PRINCIPAL FINDINGS: The following three pyrosequencing methods for 16S rRNA genes were selected in this study: Method-1 (standard method) is the recommended method for bi-directional sequencing using the LIB-A kit; Method-2 is a new option designed in this study for unidirectional sequencing with the LIB-A kit; and Method-3 uses the LIB-L kit for unidirectional sequencing. In our comparison among these three methods using 10 different environmental samples, Method-2 and Method-3 produced 1.5-1.6 times more useable reads than the standard method (Method-1), after quality-based trimming, and did not compromise the outcome of microbial community analyses. Specifically, Method-3 is the most cost-effective unidirectional amplicon sequencing method as it provided the most reads and required the least effort in consumables management. CONCLUSIONS: Our findings clearly demonstrated that alternative pyrosequencing methods for 16S rRNA genes could drastically affect sequencing output (e.g. number of reads before and after trimming) but have little effect on the outcomes of microbial community analysis. This finding is important for both researchers and sequencing facilities utilizing 16S rRNA gene pyrosequencing for microbial ecological studies

Global distribution and diversity of marine Verrucomicrobia

Author: A Kielak
A Pol
AC Martiny
Adam C Martiny
BP Hedlund
CJF ter Braak
CR Jackson
David B Mark Welch
DH Buckley
DH Buckley
E Pruesse
E Zaikova
GT Bergmann
H Schlesner
H Schlesner
H Urakawa
J Arnds
J Yoon
J Yoon
J Yoon
J Yoon
JA Fuhrman
Jed A Fuhrman
JG Caporaso
KA O’Farrell
KH Bostrom
L Zinger
M Allgaier
Mitchell L Sogin
MS Rappe
N Bano
P Hugenholtz
P Sangwan
PH Janssen
Sara Freitas
SJ Giovannoni
SJ Giovannoni
SM Huse
SM Huse
Stephen Hatosy
Susan M Huse
YJ Choo
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 08/12/2011
Field of study

Author Posting. © The Author(s), 2011. This is the author's version of the work. It is posted here by permission of Nature Publishing Group for personal use, not for redistribution. The definitive version was published in The ISME Journal 6 (2012): 1499-1505, doi:10.1038/ismej.2012.3.Verrucomicrobia is a bacterial phylum that is commonly detected in soil but little is known about the distribution and diversity of this phylum in the marine environment. To address this, we analyzed the marine microbial community composition in 506 samples from the International Census of Marine Microbes as well as eleven coastal samples taken from the California Current. These samples from both the water column and sediments covered a wide range of environmental conditions. Verrucomicrobia were present in 98% of the analyzed samples and thus appeared nearly ubiquitous in the ocean. Based on the occurrence of amplified 16S rRNA sequences, Verrucomicrobia constituted on average 2% of the water column and 1.4% of the sediment bacterial communities. The diversity of Verrucomicrobia displayed a biogeography at multiple taxonomic levels and thus, specific lineages appeared to have clear habitat preference. We found that Subdivision 1 and 4 generally dominated marine bacterial communities, whereas Subdivision 2 was confined to low salinity waters. Within the subdivisions, Verrucomicrobia community composition were significantly different in the water column compared to sediment as well as within the water column along gradients of salinity, temperature, nitrate, depth, and overall water column depth. Although we still know little about the ecophysiology of Verrucomicrobia lineages, the ubiquity of this phylum suggests that it may be important for the biogeochemical cycle of carbon in the ocean.We would like to thank the UCI Undergraduate Research Opportunity Program (S.F.), the National Science Foundation (OCE-0928544 and OCE-1046297, A.C.M.) and the Alfred P. Sloan Foundation (S.H., D.M.W., M.S.) for supporting the work

Woods Hole Open Access Server

SAMQA: error classification and validation of high-throughput sequenced read data

Author: A McKenna
B Langmead
DC Koboldt
H Li
H Li
H Li
J Dean
John Boyle
MJ Dunning
N Homer
PLF Johnson
R Pinard
Ryan Bressler
Sarah Killcoyne
SM Huse
T White
Thomas Robinson
Publication venue: BioMed Central
Publication date: 01/01/2011
Field of study

Abstract Background The advances in high-throughput sequencing technologies and growth in data sizes has highlighted the need for scalable tools to perform quality assurance testing. These tests are necessary to ensure that data is of a minimum necessary standard for use in downstream analysis. In this paper we present the SAMQA tool to rapidly and robustly identify errors in population-scale sequence data. Results SAMQA has been used on samples from three separate sets of cancer genome data from The Cancer Genome Atlas (TCGA) project. Using technical standards provided by the SAM specification and biological standards defined by researchers, we have classified errors in these sequence data sets relative to individual reads within a sample. Due to an observed linearithmic speedup through the use of a high-performance computing (HPC) framework for the majority of tasks, poor quality data was identified prior to secondary analysis in significantly less time on the HPC framework than the same data run using alternative parallelization strategies on a single server. Conclusions The SAMQA toolset validates a minimum set of data quality standards across whole-genome and exome sequences. It is tuned to run on a high-performance computational framework, enabling QA across hundreds gigabytes of samples regardless of coverage or sample type.</p

Springer - Publisher Connector

Comparison of brush and biopsy sampling methods of the ileal pouch for assessment of mucosa-associated microbiota of human subjects

Author: A Chao
A Everard
A Nitsche
AM Eren
BJ Haas
C Dejea
E Pruesse
JP Wang
K Vipperla
L Öhman
MA Atkinson
MA Nadkarni
N Segata
R Core Team
RC Edgar
S Huse
S Pushalkar
SM Huse
SM Huse
TM Schmidt
V Leone
VB Young
VB Young
VT Marteinsson
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

arXiv.org e-Print Archive

Interpreting 16S metagenomic data without clustering to achieve sub-OTU resolution

Author: A Klindworth
A Shade
A Shade
AM Eren
BJ Haas
C Huttenhower
C Lozupone
C Quince
C Quince
DE Hunt
DN Fredricks
EK Costello
EK Costello
H Ochman
JG Caporaso
JG Caporaso
JI Prosser
JJ Faith
JL VandeWalle
JR Brestoff
M Hamady
MGI Langille
Mikhail Tikhonov
MJ Morgan
MJ Rosen
N Fierer
N Kamada
ND Youngblut
Ned S Wingreen
O Lukjancenko
PD Schloss
PD Schloss
PD Schloss
PJ Turnbaugh
RC Edgar
RC Edgar
RC Edgar
Robert W Leach
SJ Song
SM Huse
SP Preheim
TP Tourova
V Kunin
WJ Sul
Y Huang
ZJ Zheng
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 11/07/2014
Field of study

The standard approach to analyzing 16S tag sequence data, which relies on clustering reads by sequence similarity into Operational Taxonomic Units (OTUs), underexploits the accuracy of modern sequencing technology. We present a clustering-free approach to multi-sample Illumina datasets that can identify independent bacterial subpopulations regardless of the similarity of their 16S tag sequences. Using published data from a longitudinal time-series study of human tongue microbiota, we are able to resolve within standard 97% similarity OTUs up to 20 distinct subpopulations, all ecologically distinct but with 16S tags differing by as little as 1 nucleotide (99.2% similarity). A comparative analysis of oral communities of two cohabiting individuals reveals that most such subpopulations are shared between the two communities at 100% sequence identity, and that dynamical similarity between subpopulations in one host is strongly predictive of dynamical similarity between the same subpopulations in the other host. Our method can also be applied to samples collected in cross-sectional studies and can be used with the 454 sequencing platform. We discuss how the sub-OTU resolution of our approach can provide new insight into factors shaping community assembly.Comment: Updated to match the published version. 12 pages, 5 figures + supplement. Significantly revised for clarity, references added, results not change

Princeton University Open Access Repository

PanGEA: Identification of allele specific gene expression using the 454 technology

Author: AP Weber
Christian Schlötterer
ER Mardis
M Margulies
M Pop
O Gotoh
Robert Kofler
SC Schuster
SF Altschul
SM Huse
Tamas Lelley
Tatiana Teixeira Torres
TD Harris
TF Smith
TT Torres
W Brockman
WR Pearson
Z Ning
Publication venue: BioMed Central
Publication date: 01/01/2009
Field of study

Abstract Background Next generation sequencing technologies hold great potential for many biological questions. While mainly used for genomic sequencing, they are also very promising for gene expression profiling. Sequencing of cDNA does not only provide an estimate of the absolute expression level, it can also be used for the identification of allele specific gene expression. Results We developed PanGEA, a tool which enables a fast and user-friendly analysis of allele specific gene expression using the 454 technology. PanGEA allows mapping of 454-ESTs to genes or whole genomes, displaying gene expression profiles, identification of SNPs and the quantification of allele specific gene expression. The intuitive GUI of PanGEA facilitates a flexible and interactive analysis of the data. PanGEA additionally implements a modification of the Smith-Waterman algorithm which deals with incorrect estimates of homopolymer length as occuring in the 454 technology Conclusion To our knowledge, PanGEA is the first tool which facilitates the identification of allele specific gene expression. PanGEA is distributed under the Mozilla Public License and available at: <url>http://www.kofler.or.at/bioinformatics/PanGEA</url></p

Springer - Publisher Connector